Towards a Unified Framework

نویسنده

  • Xiaolong Mou
چکیده

Conversational interfaces have received much attention as a promising natural communication channel between humans and computers. A typical conversational interface consists of three major systems: speech understanding, dialog management and spoken language generation. In such a conversational interface, speech recognition as the front-end of speech understanding remains to be one of the fundamental challenges for establishing robust and effective human/computer communications. On the one hand, the speech recognition component in a conversational interface lives in a rich system environment. Diverse sources of knowledge are available and can potentially be beneficial to its robustness and accuracy. For example, the natural language understanding component can provide linguistic knowledge in syntax and semantics that helps constrain the recognition search space. On the other hand, the speech recognition component also faces the challenge of spontaneous speech, and it is important to address the casualness of speech using the knowledge sources available. For example, sub-lexical linguistic information would be very useful in providing linguistic support for previously unseen words, and dynamic reliability modeling may help improve recognition robustness for poorly articulated speech. In this thesis, we mainly focused on the integration of knowledge sources within the speech understanding system of a conversational interface. More specifically, we studied the formalization and integration of hierarchical linguistic knowledge at both the sub-lexical level and the supra-lexical level, and proposed a unified framework for integrating hierarchical linguistic knowledge in speech recognition using layered finite-state transducers (FSTs). Within the proposed framework, we developed context-dependent hierarchical linguistic models at both sub-lexical and supra-lexical levels. FSTs were designed and constructed to encode both structure and probability constraints provided by the hierarchical linguistic models. We also studied empirically the feasibility and effectiveness of integrating hierarchical linguistic knowledge into speech recognition using the proposed framework. We found that, at the sub-lexical level, hierarchical linguistic modeling is effective in providing generic sub-word structure and probability constraints. Since such constraints are not restricted to a fixed system vocabulary, they can help the recognizer correctly identify previously unseen words. Together with the unknown word support from natural language understanding, a conversational interface would be able to deal with unknown words better, and can possibly incorporate them into the active recognition vocabulary on-the-fly. At the

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

$L$-enriched topological systems---a common framework of $L$-topology and $L$-frames

Employing the notions of the strong $L$-topology introduced by Zhangand the $L$-frame introduced by Yao  and the concept of $L$-enrichedtopological system defined in the present paper, we constructadjunctions among the categories {bf St$L$-Top} of strong$L$-topological spaces, {bf S$L$-Loc} of strict $L$-locales and{bf $L$-EnTopSys} of $L$-enriched topological systems. All of theseconcepts are ...

متن کامل

an Application of the Logit Model to the Analysis of Informal Sector Activities

This paper reports on the results of a research carried out during 1993-94 aiming at studying street sellers as an informal sector activity and a source of socio-economic problems in Shiraz, Iran. This study makes use of two Logit models in order to present a unified framework depiciting the factors that seem to affect the developmeht of informal sector activities. The first model looks at thos...

متن کامل

The Role of Computer Anxiety in Acceptance of Iranian Public Library Management System Based on the Unified Theory of Acceptance and Use of Technology

Purpose: The main purpose of this study was to measure the acceptance and role of computer anxiety among the users of public libraries in Kerman province while using Iranian Public Library Management System (SAMAN) within the framework of the unified theory of acceptance and use of technology (UTAUT). Method: This is an applied study in terms of purpose and a descriptive study conducted using ...

متن کامل

A Multi-Formalism Modeling Framework: Formal Definitions, Model Composition and Solution Strategies

In this paper, we present a multi-formalism modeling framework (abbreviated by MFMF) for modeling and simulation. The proposed framework is defined based on the concepts of meta-models and uses object-orientation to overcome the complexities and to enhance the extensibility. The framework can be used as a basis for modeling by various formalisms and to support model composition in a unified man...

متن کامل

A Multi-Formalism Modeling Framework: Formal Definitions, Model Composition and Solution Strategies

In this paper, we present a multi-formalism modeling framework (abbreviated by MFMF) for modeling and simulation. The proposed framework is defined based on the concepts of meta-models and uses object-orientation to overcome the complexities and to enhance the extensibility. The framework can be used as a basis for modeling by various formalisms and to support model composition in a unified man...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014